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PROCESS FOR DNA REPLICATION 

This application was supported by NIH Grant No. GM34557. The United 
States may have rights under this application. 

Tliis application claims priority from US Provisional Application No. 
60/1 18,703, wliich application is incorporated herein by reference for those countries where 
such incorporation is allowed. 



Background of the Invention 

Tliis application relates to a process for DNA replication, and to the 
application of this process for a variety of purposes. 

10 Replication of DNA and other nucleic acids is a complex natural phenomenon 

which occurs within all biological systems. To facilitate the exploitation of the resources 
represented in the diverse genetic materials of the world's organisms, however, it is desirable 
to be able to replicate selected DNA sequences under more controlled conditions, for 
example to produce increased amounts of one sequence. Such replication of selected DNA 

15 sequences is required for a great many applications of potential scientific and industrial 

significance, and has been accomplished by a variety of techniques. These include cloning of 
the DNA sequences into plasmids or genes, and replication of the plasmid using the DNA 
replication mechanisms of a host organism, and amplification techniques such as PGR or 
ligase amplification. Cloning is capable of replicating complete gene sequences, but requires 

20 the introduction of the sequences into a host organism, and the subsequent recovery of the 

duplicated DNA. PGR and similar amplification techniques offer increased flexibility, 
including the ability to introduce labels and/or sequence variations into the replicated DNA, 
and avoid the use of a host organism, but are limited in the length of the sequence which can 
be replicated. Thus, there remains a need for a methodology which will pemiit the 
. 25 replication of long DNA molecules, wliile providing the flexibility associated with PGR 

amplification. It is an object of the present invention to provide such a methodology. 
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SuiTimarv of the Invention 

The present invention provides a method for replicating DNA, and in 
particular for replicating large segments of DNA. In accordance with the invention, a primer 
is combined with a target DNA molecule to be replicated. The primer is designed to be at 
5 least partially homologous to a known site on the target DNA, and to create a D-loop when 

hybridized with that site. A replisome is then assembled at the D-loop, and this replisome 
creates a copy of the DNA, starting at the primer binding site. By utilizing two species of 
D-loop primers which bind to remote sites on the DNA flanking a region to be replicated, 
large sections of DNA can be replicated in a manner comparable to PCR. 
10 The replicated DNA can be analyzed to detect variations in the genetic 

sequence of the target, for linkage mapping and as a source of longer DNA molecules 
having a desiied sequence. 



Brief Description of the Drawing 
15 Fig. 1 shows the scheme used for making a double- stranded circular tenplate 

DNA molecule containing a D-loop, which was used to validate the concept of the 
invention. 



Detailed Description of the Invention 
20 The present invention provides a method for the controlled replication, 

generally in vitro, of selected regions of DNA. hi accordance with the invention, replication 

of a target region of a target DNA molecule is accomplished by: 

(a) introducing a D-loop into the target DNA molecule at a selected 

initiation point adjacent to the tai'get region; 
25 (b) assembling a replisome at the D-loop; and 

(c) providing DNA monomers (dNTPs) and ATP, whereby the target 

region is rephcated, ATP is preferably provided at concentrations in excess of about 1 mM. 

ATP is required because the formation of a processive DNA polymerase complex requires 

ATP hydrolysis and also because DnaB, die DNA helicase, requires concentration in excess 
30 of 1 mM to be maximally active. 
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Introduction of a D-loop at a selected initiation site in duplex DNA can be 
accomplished using an oligonucleotide primer which hybridizes with double-stranded DNA 
at a selected initiation site. The non-hybridized strand is displaced to create the D-loop. D- 
loop formation can be driven by the homologous pairing enzyme, RecA, as has been 
5 described in the literature. See, McEntee et al, Proc. NatHAcad. ScL (USA) 76: 2615-2619 

(1979), wliich is incorporated herein by reference. D-loop foraiation could also be driven 
by other methods, for example heating at a moderately high temperature (for example 75- 
80°C) may be enough to drive annealing, particularly in regions rich in A+T bases. 

The oligonucleotide primer which is used for generation of the D-loop 

10 generally has a length of from 20 to about 50 bases. The primer is selected to be 

substantially complementary to one of the two strands of the target DNA duplex at the 
initiation site. As used herein, the tenn "substantially complementary" refers to a primer 
which wiU hybridize with the target DNA duplex under conditions of moderately high 
stringency. However, it will be appreciated that RecA mediated hybridization, if employed, 

15 is an enzymatic strand-pairing reaction, and that conditions normally used for DNA-DNA 

hybridization (e.g. 0.6 M NaCl) would actually be inhibitory. Tlius the precise conditions 
corresponding to "moderately high stringency" may vary depending on the methodology 
used to drive the annealing. In a general sense, however, the term "substantially 
complementary" includes (1) primers which are perfectly complementary to the target DNA 

20 molecule, (2) primers which are complementary for most of their length, but which include 

one or several mismatches from perfect complementarity, although not enough mismatches 
to significantly reduce hybridization specificity; and (3) degenerate printers which include 
several bases at a given site to accommodate a multiplicity of common alleles in the target 
DNA. Tlie use of mismatched primers may result from the presence of a mutation in the 

25 initiation site, or the ixdsmatch may be intentionally selected for introduction of a desired 

sequence variation into the replicated DNA. 

The primers used m the invention may also include one or more non- 
hybridized regions for the purpose of introducing a desired additional sequence into the 
replicated DNA. For example, tliis additional sequence may be a sequence which introduces 

30 a restriction site near the end of the replicated DNA to facilitate insertion of the replicated 

copies into other DNA molecules. Preferred restriction sites wiQ be those recognized by 
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rare-cuttiiig restriction eiizjanes wliich generally recognize 8 -base sequences, or intron- 
honaing endonucleases such as Pl-Scel from yeast which recognizes a 31 -base pair sequence. 
This will reduce the likeliliood of cleavage occurring within the replicated DNA at other 
than the intended cleavage site. 

In an alternative embodiment of the invention useful with single-stranded 
templates, the primer used comprises a 3 - and a 5' region which are substantially 
complementary to portions of the target DNA template, and a central non-corrplementary 
region which fomis a D-loop when the primer is hybridized with the target DNA. A second 
primer wliich is complementary is used to form the invading strand of the D-loop. Similar 
variations for insertion of cleavage sites etc, may be incorporated in the structure of such 
primers. 

Tlie primers used in the method of the invention inay also include a 
detectable label or capture moiety. Suitable detectable labels and capture moieties are well 
known in the art as comparable materials are used in PCR, nucleic acid sequencing, and 
hybridization-based assays. Specific, non- limiting examples of suitable labels and/or capture 
moieties include fluorescent dyes such as fluorescein, Texas Red or cyanine dyes; enzyme 
labels such as alkaline phosphatase; and capturable labels such as bio tin. Nucleic acid tails 
which specifically interact with a known capture sequence can also be employed. 

In a preferred embodiment of the invention, die primer is combined with 
target double-stranded DNA under conditions suitable for hybridization and in the presence 
of the enzyme RecA, which results in the formation of a D-loop at the site of primer binding. 
Unlike common in vitro processes such as PCR, which utilize bacterial polymerases of 
inherently low processivity, the present invention utilizes replisomes. Replisomes are multi- 
protein associations which fonii at a replication fork and act in concert to replicate DNA. 
Replisomes provide much greater processivity than polymerases used for PCR. For 
example, the £. coli replisome can synthesize pieces of DNA at least as long as a megabase 
(1 X 10^ nucleotides). The fidelity of copying is also quite high, with the £, coli repUsome 
making fewer than 1 mistake in 10"^ nucleotides synthesized. Furthennore, unlike PCR, 
replisomes are substantially insensitive to regions of secondary structure in the DNA 
template. Tlius, utilization of replisomes offers numerous advantages over the use of 
polymerases. 
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Replisomes include proteins which perform a variety of functions. 
Replication of DNA using replisomes depends on an initial unwinding of the DNA duplex at 
an origin of replication, and the continued unwinding along the strands as the replication 
process proceeds. Tliis unwinding is carried out by DNA helicases. Tlie resultant regions of 
5 single-stranded DNA are stabilized by the binding of single- stranded DNA-biuding proteins 

which are also part of the replisome. Tlie stabilized single-stranded regions are then 
accessible to the enzymatic activities of polymerases enzymes required for replication to 
proceed. 

Replisomes have been shown to be substantially self assembling. Thus, when 
10 the necessary proteins are present under appropriate conditions, the replisome will assemble. 

We have found that tliis assembly will occur at a D-loop. A preferred combination of 
proteins for formation of a replisome iii accordance with the present invention includes the 
following proteins: 

PriA, PriB, PriC, DnaT, DnaB, DnaC (primosoiTial proteins); 
15 single-stranded DNA- binding protein (SSB); and 

DNA polymerase III holoenzyme (Pol III HE). 
An alternative combination utilizes the mutant protein DnaCSlO, (described below) in place 
of PriA, PriB, PriC and DnaT. 

The preparation and recovery of these various proteins is weU described in 
20 the art, including the ait cited below which is incorporated herein by reference. Pol III HE 

may be used iu a form recovered directly by purification from E. coli, or as a combination of 
Pol III* and the p subunit. Pol III HE may also be reconstituted from individually 
overexpressed and purified subunits. These subunits are a (DnaE), s (DnaQ), 6 (HolE), p 
(DnaN), x (DnaX, ftiU length), y (DnaX, truncated), 6 (HolA), 5' (HolB), % (HolC) and V|/ 
25 (HolD). Preparation of Pol III HE is described in US Patents Nos. 5,668,004 and 5,583,026 

which are incorporated herein by reference for those countries in which such incorporation is 
permitted. 

Replisomes have been found to initiate DNA replication at the site of a D- 
loop. Thus, the D-loop fomied by the interaction of die primer with the target DNA 
30 molecule serves as the initiation site for the replication process in accordance with the 

invention. Wlien appropriate nucleic acid monomers (i.e., deoxynucleotide triphosphates. 
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dATP, dCTP, dGTP and dTTP) and ATP are available, a copy of the strand of the DNA 
molecule to which the primer hybridizes is produced. The length of replicated inaterial 
which can be produced in tliis way is much greater than the length wliich can be produced 
using PGR or comparable techniques, with lengths in excess of 5000-500,000 bases being 
5 readily attainable. Thus, the method provides the ability to make copies of entire large 

genes, including both intron and exon sequences. 

As will be apparent to persons skilled in the art, a person making copies of 
DNA will generally be interested in obtaining those copies of a particular region of the 
DNA, which is referred to herein as the "target region." The target region may be a 

10 particulai' gene, or a particular portion of a gene depending on the use for which the copied 

DNA is intended. The ability to produce copies of very large numbers of bases changes the 
practical limits on the proximity between the primer and the target region from those which 
are usually observed in the PGR and comparable methods. Thus, while the initiation site 
must be "adjacent" to the target region, this means only that the initiation site must be close 

15 enough to and on the correct side of the target region such that a replisome assembled at the 

D-loop will copy the DNA of the target region. 

In a preferred embodiment of the invention, two primers are utilized. The 
first primer is as described above, and hybridizes with a first strand of a double stranded 
DNA duplex. Tlie second primer also is a substantially complementary oligonucleotide 

20 primer, but it hybridizes to the second strand of the DNA duplex at a second initiation site 

located on the other side of the target region. Tlius, the two primers flank the target region, 
in the same manner that PGR primers flank a region to be amplified. Further, the same 
principle which leads to amplification of just the region bounded by PGR primers, leads to 
creation of much larger pieces of replicated DNA spanning the region between the two 

25 initiation sites using the method of the invention, although the efficiency may not be as great 

as acliieved with PGR. Tliis reduced efficiency is less of a problem than one might expect, 
however, since the large size of the replicated DNA makes them inherently more detectable 
than small fragments. On the other hand, since the process of the invention works on 
double-stranded DNA, it is not necessary to separate the strands of the target and the newly 

30 replicated DNA before proceeding with the next cycle. 
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While the large size of the replicated DNA offers advantages for purposes of 
detection, it may also pose problems. Very large DNA molecules (i.e., those that are 
hundred of kilobases in length) are fragile, and may be broken if manipulated in simple 
solutions. Tlius, production of fragments of such lengths, and meaningful analysis of the 
5 lengths of such fragments may require that the reaction be perfomied in a supporting matrix, 

such as an agarose gel. Replicated DNA can be transferred out of the supporting matrix, for 
example tor introduction into a matrix for separation based on size by electrophoresis. 

DNA replicated in accordance with the invention may be utilized for a variety 
of purposes. First, the replicated DNA may be used as a source of genetic material to be 

10 spliced into still larger nucleic acid constructs, includiug plasmids, cosmids, viral vectors 

etc., to facilitate expression of the replicated DNA in a suitable host system Such splicing 
can be facilitated by the incorporation of restriction sites near then ends of the replicated 
DNA as discussed above. Wlien two primers are utilized, restriction sites can be introduced 
at both ends of the replicated DNA. 

15 Second, the replication of DNA in accordance with this method can be used 

as part of a method for detecting genomic rearrangements in a target DNA sequence. In 
such a method, a D-loop is introduced into the DNA at a selected initiation point, a 
replisome is assembled at the D-loop, and the DNA is copied to produce sufficient numbers 
of copies for analysis. Tlie copied product is analyzed to detect variations in size or 

20 organization of the copied material using size-specific separations, hybridization probes and 

other standard analytical tecliniques. It wiU be appreciated that the use of size-specific 
separations requires the production of a product of defined lengths, and thus wiU generally 
require the use of the two piimer embodiment discussed above. On the other hand, where 
the analysis involves the measurement of the interaction of the DNA with a labeled or 

25 inLimobilized probe, the replication of multiple copies of a single strand of the DNA, without 

amplification, may be sufficient. 

Tliird, the method can be used to facilitate linkage mapping. For example, 
the method can be used in the circumstance where two chromosomal markers are known to 
be near one another, but where the exact distance separating them is not known. D-loop 

30 oligonucleotide primers are synthesized for each marker for both the DNA strands. 

Combinations of the primers are used to replicate the region between the two markers, and 
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the size of the product foiTiied reflects the chromosomal distance between the two markers. 
The method may also be used to map unlinked genes, and markers such as RFLPs, SNEPs 
and ESTs. 

To demonstrate the ability of the replisomes to assemble at a D loop and 
replicate the DNA, we used a small bacteriophage DNA molecule as a model system as 
described in the following non-limiting examples. The conditions for replisome assembly and 
DNA replication can be extended to use with larger molecules, and with substantially 
complementary primers as discussed above. 



EXAMPLE 1 
Preparation of DNA Replication Proteins 

To prepare DnaCSlO, a dnaCSlO open reading frame was constructed by 
splicing overlap extension polymerase chain reaction and cloned into the Ndel site of the 
pETl IC overexpression plasinid (Novagen). Overexpression and purification of DnaCSlO 
was as for the wild type protein. 

PriA, PriB, PriC, DnaT, DnaB and DnaC were purified by the methods 
described in Marians, KJ. Methods EnzymoL 262: 507-521 (1995). SSB was purified using 
the procedures described in Minden and Marians, 7. BioL Chem. 260: 9316-9325 (1985). 
The DNA polymerase III holoenzjone was either reconstituted from Pol III* and p subunit 
as described by Wu et al. 7. BioL Chem. 267: 4030-4044 (1992) or from purified subunits as 
described in Marians et al., J. BioL Chem, 273: 2452-2457 (1998). 



EXAMPLE 2 

To validate the operabiUty of the mventive concept, a double- stranded 
circular template DNA was prepared in accordance with the steps shown in Fig. 1. A 100 
nt-long oligonucleotide primer (Seq. ID No. 1) was atinealed to flR408 vii*al DNA (Russell 
et al.. Gene 45: 333-339 (1986)). Tlae central 42 nt of tliis oligonucleotide are 
non-homologous with the template, thus forming a D-loop in the resulting heteroduplex. 
Incubation of the heteroduplex with DNA Polymerase III lioloenzyme in the presence of 
SSB and DNA monomers resulted in the extension of the primer and the formation of a 
nicked form II DNA with a 42 nt-lone bubble reeion. Durine the last two minutes of this 
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iiicubation, ddTTP and ddATP were introduced at concentrations 20-fold liigher than dTTP 
and dATP to ensure that complementary strand synthesis could not be extended further. 
After phenol extraction and ethanol precipitation, the DNA products were purified by 
electrophoresis through native agarose gels. Complete forni II bubble DNA was recovered 
5 from the gel and a [5 -^-P] minus strand oligonucleotide (Seq. ID. No. 2) was then annealed 

to the D loop form II template. Tlie template was then gel filtered through Biogel A5M to 
remove unamiealed oligonucleotide and unincorporated [y-^^P] ATP. 



EXAMPLE 3 

10 Reaction mixtures (12 /il) containing 50 mM Hepes-KOH (pH 8.0), 10 mM 

MgOAc, 10 mM DTT, 80 mM KCl, 200 /^g/ml bovine serum albumin, 2 mM ATP, 40 /^M 
dNTPs, 0.42 nM [^^P] form II D loop DNA template, 0.5 f^M SSB, 225 nM DnaC, 30 nM 
DNA polymerase III holoenzyme, PriA, PriB, PriC, DnaT and DnaB were incubated at 
37 °C for 10 minutes. To test the sufficiency of various combinations of proteins to replicate 

15 the template prepared m Example 2, reactions were also performed in which one of the 

proteins (PriA, PriB, PriC, DnaT, DnaC and DnaB) was omitted in each reaction mixture. 
As controls, template alone and ten^late with the holoenzyme alone were also evaluated. 
Reactions were terminated by the addition of EDTA to a concentration of 25 mM and 
NaOH to a concentration of 50 mM. The reaction products were evaluated by 

20 electrophoresis at 2 V/cm for 20 hours at room temperature thi'ough horizontal 0.7% 

alkaline agarose gels using 30 mM NaOH, 2 mM EDTA as the electrophoresis buffer. The 
gels were neutralized, dried and analyzed by autoradiography. 

The electrophoresis gels showed that incubation of the D-loop template, the 
seven primosomal proteins, SSB and DNA polymerase III holoenzyme resulted in extension 

25 of the invading strand oligonucleotide (42 nt, Seq. ID. No. 2) to the fuU length template size 

(6.4 kb). Tlie efficiency of the reaction varied, but generally 15-30% of the invading strand 
could be elongated to full length in a 10 minute incubation. Tlie reaction exhibited an 
absolute requirement for aU of the primosomal proteins except PriC. Omission of this 
protein resulted in a decrease in DNA synthesis to one-tIiii*d that of the complete reaction. 

30 This observation was similar to those reported for replication on different templates. Ng et 

al., y, BioL Chem. 271: 15642-15648 (1996). Some extension of the invading strand by the 
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holoeiizyme aloue could be observed, but tliis was suppressed by the presence of PriA. If 
the invading strand was omitted from the reaction, and [a-^"P] dATP was included, no DNA 
replication could be observed. 



5 EXAMPLE 4 

Because DNA heUcases were being introduced to the DNA during 
primosome assembly, extension of the invading strand could result from one of two 
processes: either (1) assembly of a bona fide replication fork at the D loop followed by 
elongation of the leading strand coupled with unwinding of the duplex DNA template, or (2) 

10 uncoupled unwinding of the template DNA leaving an oligonucleotide annealed to the viral 

single stranded DNA that could be elongated in a primer extension reaction by the 
polymerase. We previously showed that coupled replication fork action requires a protein- 
protein interaction between DnaB and the t subunit of the holoenzyme. Kim et al.. Cell 84: 
643-650 (1996). In the presence of this interaction, replication forks could move rapidly, at 

15 nearly 1000 nt/sec, whereas in its absence, the polymerase becomes stuck behind a slow- 

moving helicase and replication fork progression proceeds at only about 30 nt/sec. 

To evaluate the mechanism active in the replication of DNA in the method of 
the invention, the speed of elongation of the invading strand was assessed in the presence 
and absence of t using holoenzjone reconstituted from individual purified subunits. Ten 

20 second time points were taken from the start of the reaction, and the elongated products 

were examined on denaturing gels. Full length material could be observed in the presence of 
T after 10 seconds, whereas even after 60 seconds no fuU length material was observed in its 
absence. This corresponds to a rate of replication fork progression in the presence of t of 
600-700 nt/sec, similar to what has been observed in the past for other replication systems. 

25 Mok et al, /. Biol Chem, 262: 16644-16654 (1987). Thus, we conclude that bona fide 

replication fork assembly occurs at the D loop on the template in the presence of 
primosomal proteins, SSB and the holoenzyme. 



EXAMPLE 5 

30 All of the pheno types oipriA nuU mutations can be suppressed by mutated 

pi'iA alleles that encode PriA proteins that are no longer ATPases or DNA helicases, but still 
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catalyze primosome assembly, Zavitz et aL, /. Biol. Chem. 267: 6933-6940 (1992). These 
mutations are substitutions in the invariant Lys in the Walker A box nucleotide-binding 
motif. If the PriA-dependent replication fork assembly described here were relevant to what 
happened in the cell, we would expect these mutant proteins to substitute fully for wild-type 
5 PriA in the replication reaction. To test this, three mutant proteins, having the K230R, 

K230A and K230D substitutions were tested. All three supported replication on the D loop 
to a greater extent than the wild-type protein. Tliis same type of improved activity in the 
mutant protems has been observed in other systems (Zavitz, supra), and may arise because 
the mutant proteins remain bound to the site of DNA binding, providing a better target than 
10 the wild-tN'pe protein that can move off the site because of its helicase activity. 

EXAMPLE 6 

E. coli strains carrying phA mutations are very difficult to grow. Tliey are 
rich-media sensitive, form huge filaments, and have a viability roughly one-hundredth that of 

15 the wild-type. Sandler et al.. Genetics 143: 5-13 (1996); Nurse et al., / Bacteriol 6686- 

6693 (1991); Masai et al, EMBO J. 13: 5338-5345 (1994). Suppressor mutations that 
restore viability, as well as ablate constitutive induction of the SOS response and the defects 
in homologous repair of UV-damaged DNA, arise overnight after transduction of the 
priA2:kan allele into fresh recipient cells. The mutations map to dnaC, (Sandler, supra), 

20 DnaC forms a complex with DnaB in solution (Wicker et al, Proc. Natl Acad ScU (USA) 

72: 921-925 (1975), and is required for the efficient transfer of DnaB to DNA in the 
presence of other replication protein. Marians et al., Ann, Rev, Biochem, 61: 673-719 
(1992). In order to assess the biochemical properties of these altered DnaC proteins, one 
such suppressor allele, dnaCSlO, was molecularly cloned into an expression plasmid and the 

25 mutant protein purified as described in Example 7, infra. 

Strains carrying dnaCSlO no longer require PriA for viability. TTiis suggests 
that if the essential role for PriA in cellular metabolism was to catalyze assembly of 
replication forks at recombination intermediates, DnaCSlO must be able to bypass the 
requirement for PriA to recognize the D loop and nucleate the assembly of a primosome. 

30 Accordingly, we tested whether DnaCSlO alone could direct transfer of DnaB to the D loop 

template DNA. 
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In the presence of SSB and the holoenzyme, the combination of wild-type 
DnaC and DnaB did not support elongation of the invading strand of the D loop. On the 
other hand, DnaCSlO was clearly able to load DnaB to the D loop on the template in the 
absence of the other prime somal proteins, as evidenced by the elongation of the invading 
5 strand to full length. Thus, the E176G substitution in DnaCSlO represents a true gain of 

function mutation that allows bypass of the DnaB loading pathway that involves PriA, PriB, 
PriC and DnaT and permits a reduction in the number of proteins necessary for the practice 
of the present invention. 

Interestingly, the relative efficiencies of the replication reaction catalyzed in 

10 the presence of DnaCSlO and DnaB varied compared to the reaction catalyzed by the 

complete set of primosomal proteins. At 80 inM KCl, the DnaCSlO reaction was 5- to 10- 
fold more efficient. However, at 600 mM potassium glutamate, the reaction catalyzed by 
the complete set of proteins was more efficient by a factor of 2, Wliile not intending to be 
bound by a particular mechanism, tliis difference may arise from differences in the relative 

15 stability of intermediate complexes that are formed during the loading of DnaB to DNA. 

EXAMPLE 7 

Construction of Plasmid pETl Ic-dnaCSJO — A dnaCSlO open reading 
frame (ORF) was made by two-step overlapping polymerase chain reaction (PCR) Morton 
20 et aL, Gene 11: 61-68 (1989). The N-terminal coding region of dnaCSlO was PCR amplified 

using plasmid pETllc-rf/iaC (Marians, K.J, Methods EnzymoL 262:m 507-521 (1995)) as a 
tenqjlate and two flanking primers: 

(i) the Ndel primer (Seq. ID No. 3), wliich carries a Ndel site at the dnaC initiator codon, 
and 

25 (ii) the Agel' primer (Seq. ID. No. 4), wliich carries the designed point mutation (E176G, 

GAA-GGT). The C-terminal coding region of dnaCSIO was also PCR amplified using 
plasmid pETl Ic-dnaC as a template and two different flanking primers: 

(i) the Agel prhner (Seq. ID No. 5), wliich is complementary to the Agel' primer and 

(ii) the BamHI primer (Seq. ID No. 6), wliich carries a fimnHI site just downstream of the 
30 dnaC stop codon. Tliese overlapping N- and C-terminal firaginents were gel purified after 

PCR and further PCR extended and amplified with the two flanking Ndel and BamHI 
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primers. The gel purified dnaCSlO ORF fragment was digested with Nd&l and BcivaiXl and 
Ugated with Nd&l- and 5amHI-digested pETl Ic plasmid DNA to give pETl lc-dnaC810. 

Purification ofDnaCSlO — Because of the extreme overproduction, 
DnaCSlO was followed during purification by SDS-PAGE. BL21(DE3)pLysS carrying 
pETl lc-dnaC810 was grown in 12 1 L Broth (Mainatis et al.. Molecular Cloning: A 
Laboratory Manual, Cold Spring Harbor Laboratory Press, Cold Spring Harbor, NY 
(1982)) containing 0.4% glucose and 300 mg/nd ampicillin to OD^,, = 0.4 and then induced 
in the presence of 1 mM IPTG for 3 h. CeUs were chilled, pelleted by centrifugation, and 
resuspended in 50 mM Tris-HCl (pH 8.4 at 4 °C) and 10% sucrose. The ceU suspension (50 
ml) was adjusted to 150 mM KCl, 20 mM EDTA, 5 mM ditliiothreitol, 0.02% lysozyme, 
and 0. 1% Brij 58 and mcubated at 0 °C tor 10 min. This suspension was centrifuged at 
100,000 X g tor 1 h (Sorvall T865 rotor). Tlie supernatant (fraction 1, 65 ml, 3510 mg 
protein) was adjusted to 0.04% polymin P by dropwise addition of a 1% solution. The 
precipitate was removed by centrifugation at 47,000 x g in a Sorvall SS-34 rotor for 30 min. 
The supernatant was fiirther subjected to (NH4)2S04fi-actionation (50% saturation) by the 
addition of solid. The resulting protein pellet was collected by centrifugation at 47,000 x g 
in a Sorvall SS-34 rotor for 30 min. Tlie protein pellet was resuspended in 8 ml of buffer A 
[50 mM Tris-HCl (pH 7.5 at 4 °C), 1 mM EDTA, 5 mM dithiothreitol, 20% glycerol, 0.017c 
Brij 58] + 50 mM NaCl to give fraction 2 (13 ml, 1 108 mg protein). Fraction 2 was dialyzed 
against 2 1 of buffer A + 50 mM NaCl for 12 h and then loaded onto a 100-ml DEAE- 
cellulose column (4 cm x 20 cm) that had been equilibrated previously with buffer A + 50 
mM NaCl. The column was washed with 200 ml of buffer A + 50 mM NaCl. Fractions (15 
ml) of the flow-through and wash that contained protein were pooled to give fraction 3 (8 1 
ml, 363 mg protein). Fraction 3 was loaded directly onto a 35-ml SP-Sepharose FF column 
(formed in a 60-ml disposable syringe) that had been equilibrated previously with buffer A + 
50 mM NaCl. Tlie column was washed with 200 ml of buffer A + 50 mM NaCl and protein 
was then eluted with a 350-ml linear gradient of 50-300 mM NaCl in buffer A. DnaCSlO 
eluted at 175 mM NaCI (fraction 4, 24 ml, 25 mg protein). Fraction 4 was then loaded 
dii-ectly onto a 6-iTa hydroxylapatite column (packed in a 10-ml disposable syringe) diat had 
been equilibrated previously with buffer A + 200 mM NaCl. The column was washed with 
12 ml of equilibration buffer and protein was eluted with a 60-ml linear gradient of 0-400 
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mM (NH4)2SO-;m buifer A + 200 mM NaCl. DnaCSlO eluted at 150 mM (NH4)2S04to give 
fraction 5 (5.2 iiil, 16.5 mg protein). Fraction 5 was concentrated bydialyzing against buffer 
A + 50 iTiM NaCl + 30% polyethylene glycol 20,000 and loaded onto a 125-ml Superdex- 
200 FPhC column that had been equilibrated with buffer A + 50 mM NaCl. The column was 
eluted at 1 ml/min. Fractions (1 ml) containing DnaCSlO were pooled to give fraction 6 (7.5 
ml, 9.2 mg protein). Fraction 6 was then loaded onto a 3-ml phosphocellulose column that 
had been equilibrated with buffer A + 50 mM NaCl. The column was washed with 6 ml of 
equilibration buffer and protein was eluted with a 60-ml linear gradient of 50-400 mM NaCl 
in buffer A. DnaCSlO eluted at 250 mM NaCl (Fraction 7, 3.5 ml, 5.2 mg protein). 
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Sequence Listing 

(Seq. ID No. 1) ACATACATAA AGGTGGCAAC GCCATTCGAA 

ATGAGCTCCA TATGCTAGCT AGGGAGGCCC 
CCGTCACAAT CAATAGAAAA TTCATATGGT TTACCAGCGC 
ATATAAAAGA AACGCAAAGA CACCACGGAA 
TAAGTTTATT TT 

TAATGCAGGC CATATGAAAA ACGTTGGCG A CCTG 
TCGTATTTCG AACCGGTCTG CACG 
CGTGCAGACC GGTTCGAAAT ACGA 
TTAAGCACTG GGATCCTTAA TACTCTTTAC CTGTTAC 



(Seq. 


ID No. 


2) 


(Seq. 


ID No. 


3) 


(Seq. 


ID No. 


4) 


(Seq. 


ID No. 


5) 


(Seq. 


ID No. 


6) 
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CLAIMS 

1 1 . A method for replication of a target region of a target DNA molecule 

2 comprising the steps of: 

3 (a) introducing a D-loop into the target DNA molecule at a first initiation 

4 point adjacent to the target region; 

5 (b) assembling a replisome at the D-Ioop; and 

6 (c) providing DNA monomers and ATP to the replisome, whereby the 

7 target region is reproduced. 

1 2. The method of claim 1, wherein the target DNA molecule is a duplex 

2 DNA. 

1 3. The method of claim 2, wherein the step of introducing a D-loop is 

2 perforaied by hybridizing the duplex DNA molecule with a first oligonucleotide primer 

3 which is substantiaUy complementary to the first initiation site. 

1 4. The method of claim 3, wherein the first oligonucleotide primer has a 

2 length of from 20 to 50 bases. 

1 5. The method of claim 3, wherein the first oligonucleotide primer 

2 comprises a detectable label or capture moiety. 

1 6. The method of claim 3, further comprising the step of introducing a 

2 second D-loop by hybridizing the duplex DNA molecule with a second oligonucleotide 

3 primer which is substantially coinplementaiy to a second initiation site, said target region 

4 lying between the first and second initiation sites. 

1 7. The method of claim 6, wherein the tirst and second oligonucleotide 

2 primers each have a length of fi-om 20 to 50 bases. 
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1 8. Tlie method of claim 6, wherein at least one of the oligonucleotide 

2 primers comprises a detectable label or capture moiety. 

1 9. The method of claim 6, wherein the replication is performed in a 

2 supporting matrix. 



1 10. The method of claim 6, wherein the replisome is assembled via the 

2 action of primosomal proteins, single-stranded DNA-binding protein and the DNA 

3 polymerase III holoenzyme. 

1 11. Tlie method of claim 10, wherein the primosomal proteins includes a 

2 mutant PriA protein wliich lacks ATPase and helicase functionality. 

1 12. Tlie method of claim 2, wherein the replication is performed in a 

2 supporting matrix. 

1 13. The method of claim 1, wherein the replication is performed in a 

2 supporting matrix. 



1 14. Tlie method of claim 1, wherein the replisome is assembled via the 

2 action of primosomal proteins, single-strand binding protein and holoenzyme III. 

1 15. The method of claim 14, wherein the primosomal proteins includes a 

2 mutant PriA protein which lacks ATPase and helicase functionality. 
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PCT/USOO/04445 



<110> Marians, Kenneth J. 
Joing, Liu 

<120> Process for DNA Replication 

<130> MSK,P-041-WO 

<140> 
<141> 

<150> 60/118,703 
<151> 1999-02-04 

<160> 6 

<170> Patentin Ver . 2.1 

<210> 1 
<211> 100 
<212> DNA 

<213> Escherichia coli 
<220> 

<223> primer 
<400> 1 

acatacataa aggtggcaac gccattcgaa atgagctcca tatgctagct agggaggc 
ccgtcacaat caatagaaaa ttcatatggt ttaccagcgc 



<210> 2 
<211> 42 
<212> DNA 

<213> Escherichia coli 
<220> 

<223> minus strand oligonucleotide 
<400> 2 

atataaaaga aacgcaaaga caccacggaa caagtttatt tt 



<210> 3 
<211> 34 
<212> DNA 

<213> Escherichia coli 
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<220> 

<223> Ndel primer 
<400> 3 

taatgcaggc catatgaaaa acgtcggcga cctg 



<210> 4 
<211> 24 
<212> DNA 

<213> Escherichia coli 
<220> 

<223> Agel' primer 
<400> 4 

tcgtatttcg aaccggtctg cacg 



<210> 5 
<211> 24 
<212> DNA 

<213> Escherichia coli 
<220> 

<223> Agel primer 
<400> 5 

cgtgcagacc ggttcgaaat acga 



<210> 6 
<211> 37 
<212> DNA 

<213> Escherichia coli 
<220> 

<223> BamHI primer 
<400> 6 

ttaagcactg ggatccttaa tactctttac ctg 
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PROCESS FOR DNA REPLICATTON 

This application was supported by NIH Grant No. GM34557. The United 
States may have rights under this application. 

Tliis application claims priority from US Provisional Application No. 
60/1 18 J03, wliich application is incorporated herein by reference for those countries where 
such incorporation is allowed. 

Background of the Invention 

Tliis application relates to a process for DNA replication, and to the 
application of this process for a variety of puiposes. 

Replication of DNA and other nucleic acids is a complex natural phenomenon 
which occurs within all biological systems. To facilitate the exploitation of the resources 
represented in the diverse genetic materials of the world's organisms, however, it is desirable 
to be able to replicate selected DNA sequences under more controlled conditions, for 
example to produce increased amounts of one sequence. Such replication of selected DNA 
sequences is required for a great many applications of potential scientific and industrial 
significance, and has been accomplished by a variety of techniques. These include cloning of 
the DNA sequences into plasmids or genes, and replication of the plasmid using the DNA 
replication mechanisms of a host organism, and amplification techniques such as PGR or 
ligase amplification. Cloning is capable of replicating complete gene sequences, but requires 
the introduction of the sequences into a host organism, and the subsequent recovery of the 
duplicated DNA. PGR and similar amplification techniques offer increased flexibility, 
including the ability to introduce labels and/or sequence variations into the replicated DNA. 
and avoid the use of a host organism, but are limited in the length of the sequence which can 
be replicated. Thus, there remams a need for a methodology which will permit the 
replication of long DNA molecules, wliile providing the flexibility associated with PGR 
amplification. It is an object of the present invention to provide such a methodology. 
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Suimiarv of the Invention 

The present invention provides a method for replicating DNA, and in 
particular for replicating large segments of DNA. In accordance with the invention, a primer 
is combined with a target DNA molecule to be replicated. The primer is designed to be at 
least partially homologous to a known site on the target DNA, and to create a D-loop when 
hybridized witli that site. A replison^ is then assembled at the D-loop, and this replisome 
creates a copy of the DNA. starting at the primer binding site. By utilizing two species of 
D-loop primers which bind to renciote sites on the DNA flanking a region to be replicated, 
large sections of DNA can be replicated in a manner conparable to PCR. 

The replicated DNA can be analyzed to detect variations in the genetic 
sequence of the target, tor linkage mapping and as a source of longer DNA molecules 
having a desii'ed sequence. 

Brief Description of the Drawing 

Fig. 1 shows the scheme used for making a double-stranded circular XtxapldXt 
DNA molecule containing a D-loop, which was used to validate the concept of the 
invention. 

Detailed Description of the Invention 

The present invention provides a method for the controlled replication, 
generally in vitro, of selected regions of DNA. In accordance with the invention, replication 
of a target region of a target DNA molecule is acconylished by; 

(a) introducing a D-loop into the target DNA molecule at a selected 
initiation point adjacent to the target region; 

(b) assembling a replisome at the D-loop; and 

(c) providing DNA monomers (dNTPs) and ATP, whereby the target 
region is replicated. ATP is preferably provided at concentrations in excess of about 1 mM. 
ATP is required because the formation of a processive DNA polymerase conplex requires 
ATP hydrolysis and also because DnaB, the DNA helicase, requires concentration in excess 
of 1 mM to be maximally active. 



SUBSTITUTE SHEET (RULE 26) 



wo 00/46408 PCT/USOO/04445 

-3- 



Introduction of a D-loop at a selected initiation site in duplex DNA can be 
accomplished using an oligonucleotide primer which hybridizes with double-stranded DNA 
at a selected initiation site. The non-hybridized strand is displaced to create the D-loop. D- 
loop formation can be driven by the homologous pairing enzymye, RecA, as has been 
5 described in the literature. See, McEntee et al, Proc. Nat'l Acad. Sci, (USA) 76: 2615-2619 
(1979), which is incorporated herein by reference. D-loop formation could also be driven 
by other methods, tor example heating at a moderately high ten^jerature (for example 75- 
80**C) may be enough to drive annealing, particularly in regions rich in A+T bases. 

The oligonucleotide primer which is used for generation of the D-loop 

10 generally has a length of from 20 to about 50 bases. The primer is selected to be 

substantially complementary to one of the two strands of the target DNA duplex at the 
initiation site. As used herein, the term "substantiaDy complementary" refers to a primer 
which will hybridize with the target DNA duplex under conditions of moderately high 
stringency. However, it will be appreciated that RecA mediated hybridization, if employed, 

15 is an enzymatic strand-pairing reaction, and that conditions normally used for DNA-DNA 

hybridization (e.g. 0.6 M NaCl) would actually be inhibitory. Thus the precise conditions 
corresponding to **moderately high stringency" may vary depending on the methodology 
used to drive the annealing. In a general sense, however, the terra "substantially 
conplementary" includes (1) primers which are perfectly conplementary to the target DNA 

20 molecule, (2) primers which are complementary for most of their length, but which include 
one or several mismatches from perfect complementarity, although not enough nrdsmatches 
to significantly reduce hybridization specificity; and (3) degenerate primers which include 
several bases at a given site to accommodate a multiplicity of common alleles in the target 
DNA. The use of mismatched primers may result from the presence of a mutation in the 

25 initiation site, or the mismatch may be intentionally selected for introduction of a desired 

sequence variation into the replicated DNA. 

The primers used in the invention may also include one or more non- 
hybridized regions for the purpose of introducing a desired additional sequence into the 
replicated DNA. For exaii^le, tliis additional sequence may be a sequence which introduces 

30 a restriction site near the end of the replicated DNA to facilitate insertion of the repUcated 

copies into other DNA molecules. Preferred restriction sites will be those recognized by 
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rare-cutting restriction enzymes wliich generally recognize 8-base sequences, or intron- 
homing endonucleases such as Pl-Scel from yeast which recognizes a 31-base pair sequence. 
This will reduce the likelihood of cleavage occurring within the replicated DNA at other 
than the intended cleavage site. 

5 In an alternative embodiment of the invention useful with single-stranded 

templates, the primer used coin)rises a 3 - and a 5' region which are substantially 
complementary to portions of the target DNA teit^late, and a central non-complementary 
region which forms a D-loop when the primer is hybridized with the target DNA. A second 
primer wliich is complementary is used to form the invading strand of the D-loop. SimUar 
10 variations for insertion of cleavage sites etc, may be incorporated in the structure of such 
primers. 

Tlie primers used in the method of the invention may also include a 
detectable label or capture moiety. Suitable detectable labels and capture moieties are well 
known in the art as comparable materials are used in PGR, nucleic acid sequencing, and 

15 hybridization-based assays. Specific, non-limiting examples of suitable labels and/or capture 
moieties include fluorescent dyes such as fluorescein, Texas Red or cyanine dyes; enzyme 
labels such as alkaline phosphatase; and capturable labels such as biotin. Nucleic acid tails 
which specificaDy interact with a known capture sequence can also be employed. 

In a preferred embodiment of the invention, the printer is combined with 

20 target double-stranded DNA under conditions suitable for hybridization and in the presence 
of the enzyme RecA, which results in the formation of a D-loop at the site of primer binding. 
Unlike common in vitro processes such as PGR, which utilize bacterial polymerases of 
inherently low processivity, the present invention utilizes replisomes. Replisomes are multi- 
protein associations which form at a replication fork and act in concert to replicate DNA. 

25 Replisomes provide much greater processivity than polymerases used for PGR. For 

example, the £. coli replisome can synthesize pieces of DNA at least as long as a megabase 
(1 X 10* nucleotides). The fidelity of copying is also quite high, with the £, coli replisome 
making fewer than 1 mistake in 10** nucleotides synthesized. Furthermore, unlike PGR, 
replisomes are substantially insensitive to regions of secondary structure in the DNA 

30 template. Tlaus, utilization of replisomes offers numerous advantages over the use of 
polymerases. 
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Replisomes include proteins which perform a variety of functions. 
Replication of DNA using replisomes depends on an initial unwinding of the DNA duplex at 
an origin of replication, and the continued unwinding along the strands as the replication 
process proceeds. Tliis unwinding is carried out by DNA helicases. The resultant regions of 
5 single-stranded DNA are stabilized by the binding of single-stranded DNA-binding proteins 

which are also part of the replisome. Hie stabilized single-stranded regions are then 
accessible to the enzymatic activities of polymerases enzymes required for replication to 
proceed. 

Replisomes have been shown to be substantially self assembling. Thus, when 
10 the necessary proteins are present under appropriate conditions, the replisome will assemble. 

We have found that tliis assembly will occur at a D-loop. A preferred combination of 
proteins for formation of a replisome in accordance with the present invention includes the 
following proteins: 

PriA, PriB, PriC, DnaT, DnaB, DnaC (primosomal proteins); 
15 single-stranded DNA-binding protein (SSB); and 

DNA polymerase III holoenzyme (Pol III HE). 
An alternative combination utilizes the mutant protein DnaCSlO, (described below) in place 
of PriA, PriB, PriC and DnaT. 

The preparation and recovery of these various proteins is well described in 
20 the art, including the art cited below which is incorporated herein by reference. Pol III HE 

may be used in a form recovered directly by purification from E. coli, or as a combination of 
Pol III* and the p subunit. Pol III HE may also be reconstituted from individually 
overexpressed and purified subunits. These subunits are a (DnaE), e (Dna<2), 0 (HolE), p 
(DnaN), x (DuaX, fuU length). 7 (DnaX, truncated), 5 (HolA), 5' (HolB), x(HolC) and v|/ 
25 (HolD). Preparation of Pol ffl HE is described in US Patents Nos. 5,668,004 and 5,583,026 

which are incorporated herein by reference for those countries in which such incorporation is 
permitted. 

Replisomes have been found to initiate DNA replication at the site of a D- 
loop. Tims, the D-loop fonned by the interaction of the primer with the target DNA 
30 molecule sei-ves as the initiation site for the replication process in accordance with the 

invention. Wlien appropriate nucleic acid monomers (i,e., deoxynucleotide triphosphates, 
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dATP, dCTP, dGTP and dTTP) and ATP are avaUable, a copy of the strand of the DNA 
molecule to which the primer hybridizes is produced. The length of replicated material 
which can be produced in tliis way is much greater than the length which can be produced 
using PGR or comparable techniques, with lengths in excess of 5000-500,000 bases being 
5 readily attainable. Thus, the method provides the ability to make copies of entire large 
genes, including both intron and exon sequences. 

As will be apparent to persons skilled in the art, a person making copies of 
DNA will generally be interested in obtaining those copies of a particular region of the 
DNA, which is referred to herein as the "target region." The target region may be a 

10 particular gene, or a particular portion of a gene depending on the use for which the copied 
DNA is intended. The ability to produce copies of very large numbers of bases changes the 
practical limits on the proximity between the primer and the target region from those which 
are usuaUy observed in the PGR and comparable methods. Thus, while the initiation site 
must be **adjacent" to the target region, this means only that the initiation site must be close 

15 enough to and on the correct side of the target region such that a replisome assembled at the 

D-loop will copy the DNA of the target region. 

In a preferred embodiment of the invention, two primers are utilized. The 
first primer is as described above, and hybridizes with a first strand of a double stranded 
DNA duplex. The second primer also is a substantially complementary oligonucleotide 

20 primer, but it hybridizes to the second strand of the DNA duplex at a second initiation site 
located on the other side of the target region. Thus, the two primers flank the target region, 
in the same manner tliat PGR primers flank a region to be amplified. Further, the same 
principle which leads to amplification of just the region bounded by PGR primers, leads to 
creation of much larger pieces of replicated DNA spanning the region between the two 

25 initiation sites using the method of the invention, although the efficiency may not be as great 

as acliieved with PGR. Tliis reduced efficiency is less of a problem than one might expect, 
however, since the large size of the replicated DNA makes them inherently more detectable 
than small ft-agments. On the other hand, since the process of the invention works on 
double- stranded DNA, it is not necessary to separate the strands of the target and the newly 

30 replicated DNA before proceeding with the next cycle. 
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While the large size of the replicated DNA offers advantages for purposes of 
detection, it may also pose problems. Very large DNA molecules (i.e., those that are 
hundred of kilobases in length) are fragile, and may be broken if manipulated in simple 
solutions. Tims, production of fragments of such lengths, and meaningful analysis of the 
5 lengths of such fragments may require that the reaction be performed in a supporting matrix, 
such as an agarose gel. Replicated DNA can be transferred out of the supporting nfiatrix, for 
exanqjle for introduction into a matrix for separation based on size by electrophoresis. 

DNA replicated in accordance with the invention may be utilized for a variety 
of purposes. First, the replicated DNA may be used as a source of genetic material to be 

10 spliced into still larger nucleic acid constructs, including plasmids, cosmids, viral vectors 

etc., to facilitate expression of the replicated DNA in a suitable host system Such splicing 
can be facilitated by the incorporation of restriction sites near then ends of the replicated 
DNA as discussed above. When two primers are utilized, restriction sites can be introduced 
at both ends of the replicated DNA. 

15 Second, the replication of DNA in accordance with this method can be used 

as part of a method for detecting genomic rearrangements in a target DNA sequence. In 
such a method, a D-loop is introduced into the DNA at a selected initiation point, a 
replisome is assembled at the D-loop, and the DNA is copied to produce sufficient numbers 
of copies tor analysis. Tlie copied product is analyzed to detect variations in size or 

20 organization of the copied material using size-specific separations, hybridization probes and 
other standard analytical tecliniques. It will be appreciated that the use of size-specific 
separations requires the production of a product of defined lengths, and thus will generally 
require the use of the two primer embodiment discussed above. On the other hand, where 
the analysis involves the measurement of the interaction of the DNA with a labeled or 

25 immobilized probe, the replication of multiple copies of a single strand of the DNA, without 
amplification, may be sufficient. 

Third, the method can be used to facilitate Unkage mapping. For example, 
the method can be used in the circumstance where two chromosomal markers are known to 
be near one another, but where the exact distance separating them is not known, D-loop 

primers are synthesized for each marker for both the DNA strands. 
Combinations of the primers are used to replicate the region between the two markers, and 
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the size of the product formed reflects the chromosomal distance between the two markers. 
The method may also be used to map unlinked genes, and markers such as RFLPs, SNIPs 
and ESTs. 

To demonstrate the ability of the replisomes to assemble at a D loop and 
5 replicate the DNA, we used a small bacteriophage DNA molecule as a model system as 

described in the following non-limiting examples. The conditions for replisome assembly and 
DNA replication can be extended to use with larger molecules, and with substantially 
con^lementary primers as discussed above. 

10 EXAMPLE 1 

Preparation of DNA Replication Proteins 
To prepare DuaC810» a dnaCSlO open reading frame was constructed by 
splicing overlap extension polymerase chain reaction and cloned into the Ndel site of the 
pETl IC overexpression plasmid (Novagen). Overexpression and pxuification of DnaCBlO 
15 was as for the wUd type protein. 

PriA, PriB. PriC, DnaT, DnaB and DnaC were purified by the methods 
described in Marians, KJ. Methods EnzymoL 262; 507-521 (1995). SSB was purified using 
the procedures described in Minden and Marians, J. Biol Chem, 260: 9316-9325 (1985). 
The DNA polymerase III holoenzjine was either reconstituted from Pol III* and p subunit 
20 as described by Wu et al. /. BioL Chem, 267: 4030-4044 (1992) or from purified subunits as 

described in Marians et al, J. BioL Chem. 273: 2452-2457 (1998). 

EXAMPLE 2 

To validate the operability of the inventive concept, a double-stranded 
25 circular template DNA was prepared in accordance with the steps shown in Fig. 1. A 100 
nt-long oligonucleotide primer (Seq. ID No. 1) was annealed to flR408 viral DNA (Russell 
et al., Gene 45: 333-339 (1986)). Tlie central 42 ut of this oligonucleotide are 
non-homologous with the template, thus forming a D-loop m the resulting heteroduplex. 
Incubation of the heteroduplex with DNA Polymerase III holoenzyme in tlie presence of 
30 SSB and DNA monomers resulted in the extension of the prinner and the formation of a 
nicked form II DNA with a 42 nt-long bubble region. During the last two minutes of this 
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incubation, ddTTP and ddATP were introduced at concentrations 20-fold higher than dTTP 
and dATP to ensure that con^lementary strand synthesis could not be extended further. 
After phenol extraction and ethanol precipitation, the DNA products were purified by 
electrophoresis through native agarose gels. Complete form II bubble DNA was recovered 
5 from the gel and a [5 -^^P] minus strand oligonucleotide (Seq. ID. No. 2) was then annealed 

to the D loop form II template, Tlie template was then gel filtered through Biogel ASM to 
remove unannealed oligonucleotide and unincorporated [y-^^P] ATP. 

EXAMPLE 3 

10 Reaction mixtures (12 //I) containing 50 mM Hepes-KOH (pH 8.0). 10 noM 

MgOAc, 10 mM DTT, 80 mM KCl, 200 ixg/vri bovine serum albumin, 2 mM ATP, 40 fM 
dNTPs, 0.42 nM [^^P] form II D loop DNA template, 0.5 mM SSB, 225 nM DnaC, 30 nM 
DNA polymerase III holoenzyme, PriA, PriB, PriC, DnaT and DnaB were incubated at 
37**C for 10 minutes. To test the sufficiency of various combinations of proteins to replicate 

15 the tenqjlate prepared in Example 2, reactions were also performed in which one of the 

proteins (PriA, PriB, PriC, DnaT, DnaC and DnaB) was omitted in each reaction mixture. 
As controls, template alone and tenqjlate with the holoenzyme alone were also evaluated. 
Reactions were terminated by the addition of EDTA to a concentration of 25 mM and 
NaOH to a concentration of 50 mM. The reaction products were evaluated by 

20 electrophoresis at 2 V/cm tor 20 hours at room temperature through horizontal 0.7% 

alkaline agarose gels using 30 mM NaOH, 2 mM EDTA as the electrophoresis buffer. The 
gels were neutralized, dried and analyzed by autoradiography. 

The electrophoresis gels showed that incubation of the D-loop template, the 
seven primosomal proteins, SSB and DNA polymerase III holoenzyme resulted in extension 

25 of the invading strand oligonucleotide (42 nt, Seq. ID. No. 2) to the full length template size 

(6.4 kb). Tlie efficiency of the reaction varied, but generally 15-30% of the invading strand 
could be elongated to full length in a 10 minute incubation. Tlie reaction exhibited an 
absolute requirement for all of the primosomal proteins except PriC. Omission of this 
protein resulted in a decrease m DNA synthesis to one-tliird that of the complete reaction. 

30 This observation was similar to those reported for replication on different templates. Ng et 

al, / BioL CherrL 21 h 15642-15648 (1996). Some extension of the mvading strand by the 
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holoeiizyme alone could be observed, but tliis was suppressed by the presence of PriA. If 
the invading strand was oniitted fi-om the reaction, and [a-^^P] dATP was included, no DNA 
replication could be observed. 

EXAMPLE 4 

Because DNA helicases were being introduced to the DNA during 
primosome assembly, extension of the invading strand could result from one of two 
processes: either (1) assembly of a bona fide replication fork at the D loop followed by 
elongation of the leading strand coupled with unwinding of the duplex DNA ten5)iate, or (2) 
uncoupled unwindmg of the template DNA leaving an oligonucleotide annealed to the viral 
single stranded DNA that could be elongated in a primer extension reaction by the 
polymerase. We previously showed that coupled replication fork action requires a protein- 
protein interaction between DnaB and the t subunit of the holoenzyme. Kim et al.., Cell 84: 
643-650 (1996). In the presence of this interaction, replication forks could move rapidly, at 
nearly 1000 nt/sec, whereas in its absence, the polymerase becomes stuck behind a slow- 
moving helicase and replication fork progression proceeds at only about 30 nt/sec. 

To evaluate the mechanism active in the replication of DNA in the method of 
the invention, the speed of elongation of the invading strand was assessed in the presence 
and absence of t using holoenzyme reconstituted from individual purified subunits. Ten 
second time points were taken from the start of the reaction, and the elongated products 
were examined on denaturing gels. Full length material could be observed in the presence of 
T after 10 seconds, whereas even after 60 seconds no ftill length material was observed in its 
absence. This corresponds to a rate of replication fork progression in the presence of t of 
600-700 nt/sec, similar to what has been observed in the past for other replication systems. 
Mok et al.. 1 Biol Chenu 262: 16644-16654 (1987). Thus, we conclude that bona fide 
replication fork assembly occurs at the D loop on the template in the presence of 
primosomal proteins, SSB and the holoenzyme. 

EXAMPLE 5 

All of the phenotypes ofpriA nuU mutations can be suppressed by mutated 
priA alleles that encode PriA proteins that are no longer ATPases or DNA helicases, but stiU 
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catalyze primosome assembly. Zavitz et a!., / BioL Chem. 267: 6933-6940 (1992). These 
mutations are substitutions in the invariant Lys in the Walker A box nucleotide-bindiag 
motif. If the PriA-dependent replication fork assembly described here were relevant to what 
happened in the cell, we would expect these mutant proteins to substitute fully for wild-type 
PriA in the replication reaction. To test this, three mutant proteins, having the K230R, 
K230A and K230D substitutions were tested. All three supported replication on the D loop 
to a greater extent than the wild-type protein. This same type of inproved activity in the 
mutant proteins has been observed in other systems (Zavitz, supra), and may arise because 
the mutant proteins remain bound to the site of DNA binding, providing a better target than 
the wild-type protein that can move off the site because of its helicase activity. 



EXAMPLE 0 

E, coli strains carrying priA mutations are very dijfficult to grow. Tliey are 
rich-media sensitive, form huge filaments, and have a viability roughly one-hundredth that of 
the wild-type. Sandler et al., Genetics 143: 5-13 (1996); Nurse et al., / BacterioL 6686- 
6693 (1991); Masai et al., EMBO J. 13: 5338-5345 (1994). Suppressor mutations that 
restore viability, as well as ablate constitutive induction of the SOS response and the defects 
in homologous repair of UV-damaged DNA, arise overnight after transduction of the 
priA2:kan allele into fresh recipient cells. The mutations map to dnaC. (Sandler, supra). 
DnaC forms a complex with DnaB in solution (Wicker et al., Proc. Natl Acad ScL (USA) 
72: 921-925 (1975), and is required for the efficient transfer of DnaB to DNA in the 
presence of other replication protein. Marians et al., Ann. Rev. Biochem. 61: 673-719 
(1992). In order to assess the biochemical properties of these altered DnaC proteins, one 
such suppressor allele, dnaCSlO, was molecularly cloned into an expression plasmid and the 
mutant protein purified as described in Example 7, infra. 

Strains carrying dnaCSlO no longer require PriA tor viability. This suggests 
that if the essential role for PriA in cellular metabolism was to catalyze assembly of 
replication forks at recombination intermediates, DnaCSlO must be able to bypass the 
requirement for PriA to recognize the D loop and nucleate the assembly of a primosome. 
Accordingly, we tested whether DnaC810 alone could direct transfer of DnaB to the D loop 
template DNA. 
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In the presence of SSB aiid the holoenzyme, the combination of wild-type 
DnaC and DnaB did not support elongation of the invading strand of the D loop. On the 
other hand, DnaC8 10 was clearly able to load DnaB to the D loop on the template in the 
absence of the other primosomal proteins, as evidenced by the elongation of the invading 
5 strand to full length. Thus, the E176G substitution in DnaCSlO represents a true gain of 

function mutation that allows bypass of the DnaB loading pathway that involves PriA, PriB, 
PriC and DnaT and permits a reduction in the number of proteins necessary for the practice 
of the present invention. 

Interestingly, the relative efficiencies of the replication reaction catalyzed in 
10 the presence of DnaC8 10 and DnaB varied compared to the reaction catalyzed by the 

con[?)lete set of primosomal proteins. At 80 mM KCl, the DnaCSlO reaction was 5- to 10- 
fold more efticient. However, at 600 mM potassium glutamate, the reaction catalyzed by 
the complete set of proteins was more efficient by a factor of 2. While not intending to be 
bound by a particular mechanism, this difference may arise from differences in the relative 
15 stability of intermediate complexes that are formed during the loading of DnaB to DNA. 

EXAMPLE 7 

Construction of Plasmid pETl Ic-dnaCSlO—A dnaCSlO open reading 
frame (ORF) was made by two-step overlapping polymerase chain reaction (PGR) Morton 
20 et al., Gene 77: 61-68 (1989). The N-terminal coding region of dnaCSlO was PGR amplified 
using plasmid pETllc-rfnaC (Marians, K.J, Methods Enzymoi 262:m 507-521 (1995)) as a 
template and two flanking primers: 

(i) the Ndel primer (Seq. ID No. 3), wliich carries a Ndtl site at the dnaC initiator codon, 
and 

25 (ii) the Agel' primer (Seq. ID. No. 4), wliich cairies the designed point mutation (E176G, 
GAA-GGT). The G-terminal coding region of dnaCSlO was also PGR amplified using 
plasmid pETl Ic-dnaC as a template and two different flanking primers: 

(i) the Agel primer (Seq. ID No. 5), wliich is conplementary to the Agel' primer and 

(ii) tlie BamHI primer (Seq. ID No, 6), wliich carries a BaivHI site just downstream of the 
30 dnaC stop codon. These overlapping N- and G-terminal fragments were gel purified after 

PGR and further PGR extended and amplified with the two flanking Ndel and BamHI 
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primers. The gel purified dnaCSlO ORF fragment was digested with Ndtl and BawHl and 
ligated with Ndeh and fiaiiiHI-digested pETl Ic plasmid DNA to give pETl Ic-dnaCSlO. 

Purification ofDnaCSlO — Because of the extreme overproduction, 
DnaCSlO was followed during purification by SDS-PAGE. BL21(DE3)pLysS carrying 
pETl \c-dnaC810 was grown in 12 1 L Broth (Mainatis et al., Molecular Cloning: A 
Laboratory Manual, Cold Spring Harbor Laboratory Press, Cold Spring Harbor, NY 
(1982)) containing 0.4% glucose and 300 mg/ml anpicillin to OD^oo = 0-4 and then induced 
in the presence of 1 mM IPTG for 3 li Cells were chilled, pelleted by centrifugation, and 
resuspended in 50 mM Tris-HCl (pH 8.4 at 4 °C) and 10% sucrose. The cell suspension (50 
ml) was adjusted to 150 mM KCl, 20 mM EDTA. 5 mM ditliiothreitol, 0.02% lysozyme, 
and 0. 1% Brij 58 and incubated at 0 °C for 10 min. This suspension was centrifuged at 
100,000 X g for 1 h (Sorvall T865 rotor). Tlie supematant (fraction 1, 65 ml, 3510 mg 
protein) was adjusted to 0.04% polymin P by dropwise addition of a 1% solution. The 
precipitate was removed by centrifiigation at 47,000 x g in a Sorvall SS-34 rotor for 30 nun. 
The supematant was ftirther subjected to (NH4)2S04 fractionation (50% saturation) by the 
addition of solid. The resulting protein pellet was collected by centrifiigation at 47,000 x g 
in a Sorvall SS-34 rotor for 30 mm. Tlie protein pellet was resuspended in 8 ml of buffer A 
[50 mM Tris-HCl (pH 7.5 at 4 °C), 1 mM EDTA, 5 mM dithiothreitdl. 20% glycerol, 0.01% 
Brij 58] + 50 mM NaCl to give fraction 2 (13 ml, 1 108 mg protein). Fraction 2 was dialyzed 
against 2 1 of buffer A + 50 mM NaCl for 12 h and then loaded onto a 100-ml DEAE- 
cellulose column (4 cm x 20 cm) that had been equilibrated previously with buffer A + 50 
mM NaCl The column was washed widi 200 ml of buffer A + 50 mM NaCl. Fractions (15 
ml) of the flow- through and wash that contained protein were pooled to give fraction 3 (81 
ml, 363 mg protein). Fraction 3 was loaded directly onto a 35-ml SP-Sepharose FF column 
(formed in a 60-mI disposable syringe) that had been equilibrated previously with buffer A + 
50 mM NaCl. Tlie column was washed with 200 ml of buffer A + 50 mM NaCl and protein 
was then eluted with a 350-ml linear gradient of 50-300 mM NaCl in buffer A. DnaC810 
eluted at 175 mM NaCl (fraction 4, 24 ml, 25 mg protein). Fraction 4 was then loaded 
dii'ectly onto a 6-ml hydroxylapatite column (packed in a lO-iiil disposable syringe) that had 
been equilibrated previously with buffer A + 200 mM NaCl. The column was washed with 
12 ml of equilibration buffer and protein was eluted with a 60-ml linear gradient of 0-400 
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mM (NH4)2S04m buffer A + 200 mM NaCl. DnaCSlO eluted at 150 mM (NH4)2S04to give 
fraction 5 (5.2 ml, 16.5 mg protein). Fraction 5 was concentrated bydialyzing against buffer 
A + 50 inM NaCl + 30% polyethylene glycol 20,000 and loaded onto a 125-nil Superdex- 
200 FPLC column that had been equilibrated with buffer A + 50 mM NaCl. The column was 
5 eluted at 1 ml/min. Fractions (1 ml) containing DnaCSlO were pooled to give fraction 6 (7.5 
ml, 9.2 mg protein). Fraction 6 was then loaded onto a 3-ml phosphocellulose colunm that 
had been equilibrated with buffer A + 50 mM NaCl. The column was washed with 6 ml of 
equilibration buffer and protein was eluted with a 60-ml linear gradient of 50-400 mM NaCl 
in buffer A. DnaCSlO eluted at 250 vcM NaCl (Fraction 7, 3.5 ml, 5.2 mg protein). 
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Sequejice Listing 

(Seq. ID No. 1) ACATACATAA AGGTGGCAAC GCCATTCGAA 

ATGAGCTCCA TATGCTAGCT AGGGAGGCCC 
CCGTCACAAT CAATAGAAAA TTCATATGGT TTACCAGCGC 

(Seq. ID No. 2) ATATAAAAGA AACGCAAAGA CACCACGGAA 

TAAGTTTATT TT 

(Seq. ID No. 3) TAATGCAGGC CATATGAAAA ACGTTGGCGA CCTG 

(Seq. ID No. 4) TCGTATTTCG AACCGGTCTG CACG 

(Seq. ID No. 5) CGTGCAGACC GGTTCGAAAT ACGA 

(Seq. ID No. 6) TTAAGCACTG GGATCCTTAA TACTCTTTAC CTGTTAC 



( 
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CLAIMS 



1 LA method for replication of a target region of a target DNA molecnle 

2 comprising the steps of: 

3 (a) introducing a D-loop into the target DNA molecule at a first initiation 

4 point adjacent to the target region; 

5 (b) assembling a replisome at the D-loop; and 

6 (c) providing DNA monomers and ATP to the replisome, whereby the 

7 target region is reproduced. 

1 2. The method of claim 1 , wherein the target DNA molecule is a duplex 

2 DNA. 

1 3. The method of claim 2, wherein the step of introducing a D-loop is 

2 performed by hybridizing the duplex DNA molecule with a first oligonucleotide primer 

3 which is substantially complementary to the first initiation site. 

1 4. The method of claim 3, wherein the first oligonucleotide primer has a 

2 length of from 20 to 50 bases. 

1 5. The method of claim 3, wherein the first oligonucleotide primer 

2 conqjrises a detectable label or capture moiety. 

1 6. The method of claim 3, further comprising the step of introducing a 

2 second D-loop by hybridizing the duplex DNA molecule with a second oligonucleotide 

3 primer which is substantially complementary to a second initiation site, said target region 

4 lying between the first and second initiation sites. 

1 7. The method of claim 6, wherein the fu*st and second oligonucleotide 

2 primers each have a length of from 20 to 50 bases. 
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1 8. The method of claim 6, wherein at least one of the oligonucleotide 

2 primers comprises a detectable label or capture moiety. 

1 9. The method of claim 6, wherein the replication is performed in a 

2 supporting matrbc. 

1 10. The method of claim 6, wherein the replisome is assembled via the 

2 action of primosomal proteins, single-stranded DNA-binding protem and tlie DNA 

3 polynaerase III holoenzyme. 

1 11. The method of claim 10, wherein the prinK)somal proteins includes a 

2 mutant PriA protein wliich lacks ATPase and helicase functionality. 

1 12. Tlie method of claim 2, wherein the replication is performed in a 

2 supporting matrix. 

1 13. The method of claim 1, wherein the replication is performed ia a 

2 supporting matrix. 

1 14. Tlie method of claim 1, wherein the replisome is assembled via the 

2 action of primosomal proteins, single-strand binding protein and holoenzyme III. 

1 15. The method of claim 14, wherein the primosomal proteins includes a 

2 mutant PriA protem which lacks ATPase and helicase functionality. 
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8. The method of claim 6, wherein at least one of the oligonucleotide 
primers comprises a detectable label or capture moiety. 



9. 



The method of claim 6, wherein the replication is performed in a 



supporting matrix. 

10. The method of claim 6, wherein the replisome is assembled via the 
action of primosonial proteins, single-stranded DNA-binding protein and tlie DNA 
polymerase III holoenzyme. 

11. The method of claim 10, wherein the primosomal proteins includes a 
mutant PriA protein wliich lacks ATPase and helicase functionality. 

12. The method of claim 2, wherein the replication is performed in a 
supporting matrix. 

13. The method of claim 1, wherein the replication is performed in a 
supporting matrbc. 

14. Tlie method of claim 1 , wherein the replisome is assembled via die 
action of primosomal proteins, single-strand binding protein and holoenzyme III. 

15. The method of claim 14, wherein the primosomal proteins includes a 
mutant PriA protein which lacks ATPase and helicase functionality. 
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<220> 

<223> Ndel primer 
<400> 3 

taatgcaggc catatgaaaa acgttggcga cctg 



<210> 4 
<211> 24 
<212> DNA 

<213> Escherichia coli 
<220> 

<223> Agel' primer 
<400> 4 

tcgtatttcg aaccggtctg cacg 



<210> 5 
<211> 24 
<212> DNA 

<213> Escherichia coli 
<220> 

<223> Agel primer 
<400> 5 

cgtgcagacc ggttcgaaat acga 



<210> 6 
<211> 37 
<212> DNA 

<213> Escherichia coli 
<220> 

<223> BamHI primer 
<400> 6 

ttaagcactg ggatccttaa tactcttcac ctgttac 
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